CDS
Accession Number | TCMCG019C36176 |
gbkey | CDS |
Protein Id | XP_022924551.1 |
Location | complement(join(1743528..1743758,1743926..1744660,1745297..1745599,1745709..1745769,1745874..1746017,1746106..1746362,1746462..1746674,1746925..1747098,1747252..1747319,1751464..1751522,1751708..1751783,1751871..1751976,1752078..1752187,1752368..1752532,1752623..1752708,1752948..1753012,1753240..1753287,1753372..1753491,1753607..1753682,1753826..1753923,1754032..1754173,1755195..1755310)) |
Gene | LOC111432000 |
GeneID | 111432000 |
Organism | Cucurbita moschata |
Protein
Length | 1150aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA418582 |
db_source | XM_023068783.1 |
Definition | DNA mismatch repair protein MSH1, mitochondrial isoform X3 [Cucurbita moschata] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGTACTGGGTGGCTACCCGAAACGTCGTTTCTTTCTCCCGGTGGCGTTTTTTGGCGCTTTTGATTGGCTTCCCTCCGCGCAACTTCACCCCATTTACTCACTCACCGGCGTTTTTTGAAAGGCAACAGCTTGAGAAGTTGCAGTTTGGAAAAGGTAGAAAATATTCAGGAGGAAGCATCAAAGCTGCTAAGAAGTTTAAAGATATTAATAATGTCCAAGACGATAAGTTCCTTTCTCACATTTCATGGTGGAAAGAGATGGTGGAATCATGCAAGAAACCGTCGTCGGTTCAGCTGGTTAAGAGGCTTGACTTCTCCAATTTGCTTGGTTTAGATATTAACCTGAAAAATGGGAGTCTTAAAGAAGGAACACTTAACTGGGAGATACTACAGTTCAAGGCAAAGTTTCCTCGAGAAGTTTTGCTTTGTAGAGTTGGAGATTTTTACGAAGCAATTGGAATAGATGCTTGCATACTTGTCGAATATGCTGGTTTGAATCCTTTTGGAGGTCAGCGTATGGATAGCGTTCCGAAAGCTGGTTGCCCTGTTGTGAATCTTCGTCAAACTTTGGATGATCTGACTCGTAACGGGTTCTCAGTGTGCATAGTGGAAGAAGTTCAAGGACCAATGCAAGCTCGTTCTCGCAAAGGACGTTTTATATCTGGGCATGCACACCCGGGCAGTCCCTATGTCTTTGGGCTTGTTGGGGTTGATCATGATCTCGACTTTCCAGAACCAATGCCTGTGGTCGGAATATCTCGATCTGCAAGAGGCTATTGCATAAGCCTTGTGATAGAGACCATGAAGACATATTCGTCAGAGGATGGTTTGACAGAGGAGGCCTTGGTTACTAAGCTGCGCACTTGTCAATACCATCATTTATTTCTTCACACTTCATTAAGGAACAACTCCTCAGGTACTTGTCGCTGGGGTGAATTCGGTGAGGGTGGTCGGCTATGGGGGGAATGTAATTCCAGACATTTCGAGTGGTTCGATGGAAATCCTCTTACTAATCTTTTGTCTAAGGTTAAAGATCTTTATGGTCTTGATGATGAAGTTACATTTAGGAACGTAACGATATCGTCCGAAAATAGGCCACATCCATTAACACTGGGAACTGCAACACAGATTGGTGCCATACCAACAGAGGGAATACCGTGTTTGTTGAAGGTGTTGCTTCCATCAAATTGTGCTGGCCTTCCTGCATTGTATATCAGGGATCTTCTTCTCAATCCTCCTGCTTATGAGATCGCGACCACTATTCAAGCAACATGCAGGCTTATGAGCAATGTCACATGTGCAATTCCAGACTTCACTTGCTTTCCACCCGCCAAGCTCGTGAAGTTACTGGAAATGAGGGAAGCCAATCATATTGAGTTCTGTAGAATGAAGAACGTACTCGACGAAATCTTACACATGCATAAAAATTGCGAGTTAAGCAATATCCTGAAATTGTTGATGGATCCTTCATCTGTGGCAACTGGGTTGAAAATTGACTACGATACATTTGTTGACAAATGTGAATGGGCTTCCAGTAGAGTTGGCGAAATGATTTTTCTCGATAATGAAAGCGAAAGCGATCAGAAAATCAATTCTTATTTTATCATTCCTAATGATTTTTTTGAGGATATGGAATCTTCTTGGAAAGGTCGTGTGAAAAGGATTCACATTGAAGAAGTGTGTACAGAAGTAGAAAGTGCAGCTGAAGCACTGTCTCTAGCAGTTACTGAAGATTTCGTCCCGATCATTTCAAGAATCAAGGCTACTACTGCGCCGCTAGGAGGTCCAAAGGGAGAAATATTGTATGCTCGGGATAATCAATCTGTCTGGTTCAAAGGAAGACGGTTTGCACCAGCTGTATGGGCTGGAAGCCCTGGAGAAGAAGAAATTAAACAATTGAAACCTGCTCTTGATTCAAAGGGTAAAAAGGTCGGGGACGAGTGGTTTACGACGAAGAAGGTGGAAGATGCTTTAACAAGGTACCAAGAGGCCAATGCCAAAGCAAAAGCAAGAGTAGTGGATTTGCTGAGGCAACTTTCCTCTGAATTGCTTGCTAAAATGAACGTTCTAATATTTGCTTCCATGTTACTCATTATCGCCAAGGCGTTATTCGCTCATGTGAGTGAAGGGAGGAGGAGAAAATGGGTTTTTCCTACCCTTGCTGCACCCAGTGATAGGTCCAAGCAGGGCAGGAAATCAATGGAGGGGAAGGTTGGGATGAAGCTGGTTGGACTATCTCCGTATTGGTTTGATGTGATAGAAGGGAATGCTGTGCAGAATAGTATTGAGATGGAGTCGTTGTTTCTTTTGACGGGTCCAAATGGGGGTGGGAAATCTAGCTTGCTTCGATCCATTTGTGCAGCTGCTTTGCTTGGGATATGTGGATTTATGGTGCCAGCAGAGTCTGCCCTGATTCCTCATTTTGATTCTATTATGCTTCATATGAAATCTTTTGATAGCCCTGCTGATGGGAAAAGTTCTTTTCAGGTGGAAATGTCAGAGATGAGATCCATCATGAGTAGAGCAACGGAAAGCAGCCTCGTACTTATAGATGAAATCTGTCGAGGAACAGAAACAGCAAAAGGCACTTGTATTGCAGGGAGCATTGTTGAAGCTCTTGATAAAGTTGGGTGCCTTGGCATTGTCTCCACTCACTTGCATGGTATATTCAATTTGCCTTTAGATATCAATAACACTGTGTTCAAAGCAATGGGAACTGTGTGTACTGATGGCCGAACGGTTCCCACTTGGAAGTTGATCGGTGGAATATGTAGAGAGAGCCTTGCCTTTGAAACAGCAAAGAATGAAGGAATCTGTGAAGCTATAATTCATAGGGCCCAAGATTTGTATCTCTCGAATTATGTTGAACAAGGGATTTCAGGAAAACAGAAGATGAATTTGTATCCCTCAAATTCTTCTCATGCAAGGCTTAATGGCAATGACAAACCCCATCTCCTGTCAAATGGTGTTACAGTAGAAGCTGAACGCCCAAAAACAGAGAAAACTAAGAAAAAGGTTGTCTCTTGGAAGGAAATTGAGGGTGCTATCACTGCAATATGCCAAAAGAAGCTGATAGAGTTTCATAAGGATAAAAACACATTGAAACCTGCAGAAATCCAATGTGTTTTGATTGATGCTAGAGAGAAGCCACCTCCATCCACAGTCGGTGCTTCGAGTGTGTATGTAATTCTTAGACCAGATGGTAAATTCTACGTCGGACAGACTGATGATCTAGAGGGTCGAGTCCATTCACATCGTTTAAAAGAAGGAATGCGGGATGCTGCATTTCTTTATTTTATAGTACCTGGGAAGAGCTTGGCATGCCAGCTTGAAACTCTTCTCATCAATCGACTTCCTGATCACGGGTTACAGCTAACTAATGTTGCTGATGGAAAGCACCGAAATTTTGGCACATCCAATCTCTTATCAGAGAATGTGACTGTTTGTTCATAA |
Protein: MYWVATRNVVSFSRWRFLALLIGFPPRNFTPFTHSPAFFERQQLEKLQFGKGRKYSGGSIKAAKKFKDINNVQDDKFLSHISWWKEMVESCKKPSSVQLVKRLDFSNLLGLDINLKNGSLKEGTLNWEILQFKAKFPREVLLCRVGDFYEAIGIDACILVEYAGLNPFGGQRMDSVPKAGCPVVNLRQTLDDLTRNGFSVCIVEEVQGPMQARSRKGRFISGHAHPGSPYVFGLVGVDHDLDFPEPMPVVGISRSARGYCISLVIETMKTYSSEDGLTEEALVTKLRTCQYHHLFLHTSLRNNSSGTCRWGEFGEGGRLWGECNSRHFEWFDGNPLTNLLSKVKDLYGLDDEVTFRNVTISSENRPHPLTLGTATQIGAIPTEGIPCLLKVLLPSNCAGLPALYIRDLLLNPPAYEIATTIQATCRLMSNVTCAIPDFTCFPPAKLVKLLEMREANHIEFCRMKNVLDEILHMHKNCELSNILKLLMDPSSVATGLKIDYDTFVDKCEWASSRVGEMIFLDNESESDQKINSYFIIPNDFFEDMESSWKGRVKRIHIEEVCTEVESAAEALSLAVTEDFVPIISRIKATTAPLGGPKGEILYARDNQSVWFKGRRFAPAVWAGSPGEEEIKQLKPALDSKGKKVGDEWFTTKKVEDALTRYQEANAKAKARVVDLLRQLSSELLAKMNVLIFASMLLIIAKALFAHVSEGRRRKWVFPTLAAPSDRSKQGRKSMEGKVGMKLVGLSPYWFDVIEGNAVQNSIEMESLFLLTGPNGGGKSSLLRSICAAALLGICGFMVPAESALIPHFDSIMLHMKSFDSPADGKSSFQVEMSEMRSIMSRATESSLVLIDEICRGTETAKGTCIAGSIVEALDKVGCLGIVSTHLHGIFNLPLDINNTVFKAMGTVCTDGRTVPTWKLIGGICRESLAFETAKNEGICEAIIHRAQDLYLSNYVEQGISGKQKMNLYPSNSSHARLNGNDKPHLLSNGVTVEAERPKTEKTKKKVVSWKEIEGAITAICQKKLIEFHKDKNTLKPAEIQCVLIDAREKPPPSTVGASSVYVILRPDGKFYVGQTDDLEGRVHSHRLKEGMRDAAFLYFIVPGKSLACQLETLLINRLPDHGLQLTNVADGKHRNFGTSNLLSENVTVCS |